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ABSTRACT 

The spatial clustering properties of HI galaxies can be studied using the formalism of 
the halo occupation distribution (HOD) . The resulting parameter constraints describe 
properties like gas richness verses environment. Unfortunately, clustering studies based 
on individual HI galaxies will be restricted to the local Universe for the foreseeable 
future, even with the deepest HI surveys. Here we discuss how clustering studies of 
the HI HOD could be extended to moderate redshift, through observations of fluctua- 
tions in the combined 21cm intensity of unresolved galaxies. In particular we make an 
analytic estimate for the clustering of HI in the HOD. Our joint goals are to estimate 
i) the amplitude of the signal, and ii) the sensitivity of telescopes like the Australian 
SKA Pathfinder to HOD parameters. We find that the power spectrum of redshifted 
21cm intensity could be used to study the distribution of HI within dark matter halos 
at z > 0.5 where individual galaxies cannot be detected. In addition to the HOD 
of HI, the amplitude of the 21cm power spectrum would also yield estimates of the 
cosmic HI content (ilm) at epochs between the local Universe, and redshifts probed 
by damped Lya absorbers. 
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■ 1 INTRODUCTION 



The cosmic star-formation rate has declined by more than 
an order of magnitude in the 8 billion years since z ^ 1 
(Lilly et al. 1996, Madau et al. 1996). Why this decline has 
taken place, and what drove it are two of the most impor- 
tant unanswered questions in our current understanding of 
galaxy formation and evolution. In cold dark matter cos- 
mologies, gas cools and collapses to form stars within grav- 
itationally bound "halos" of dark matter. These galaxies 
can then grow via continued star formation or via mergers 
with other galaxies. As a result the decline in star formation 
at 2 < 1 is presumably accompanied by a decrease in the 
amount of cold gas within halos. 

One of the issues that will need to be addressed in or- 
der to understand the evolution in star formation rate is the 
role of environment. As galaxies of a given baryonic mass 
can only reside within dark matter halos above a particular 
dark matter mass, galaxies are biased tracers of the over- 
all dark matter distribution. The clustering of dark matter 
halos is a known function of their mass (e.g., Sheth, Mo & 
Tormen 2001), and consequently the large-scale clustering 
of galaxies provides an estimate of the typical halo mass 



in which that galaxy population resides. On smaller scales, 
multiple galaxies can reside within a single (< 1 Mpc radius) 
dark matter halo, so that the number of galaxy pairs with 
small spatial separations is a strong function of the number 
of galaxies per halo. One can thus constrain the number of 
galaxies per halo as a function of halo mass by measuring 
both the small (< 1 Mpc) and large (> 1 Mpc) scale clus- 
tering of galaxies (e.g. Peacock & Smith 2000, Zheng 2005). 
The clustering of galaxy samples selected to lie within dif- 
ferent stages of galaxy formation based on their stellar and 
cold gas content therefore has the potential to play a central 
role in our understanding of the star formation history. 

In recent years large galaxy redshift surveys such as 
SDSS and the 2dFGRS have enabled detailed studies of the 
clustering of more than 100,000 optically selected galaxies in 
the nearby universe. By using clustering to understand how 
galaxies populate dark matter halos, key insights may be ob- 
tained into how galaxies grow over cosmic time. The way in 
which stellar mass populates dark matter halos has been de- 
termined through studies of clustering for optically selected 
galaxy samples. A popular formalism for modeling clustering 
on small to large scales is termed the halo occupation distri- 
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bution (HOD; e.g. Peacock & Smith 2000; Seljak 2000; Scoc- 
cimarro et al. 2001; Berlind & Weinberg 2002; Zheng 2004). 
The HOD includes contributions to galaxy clustering from 
pairs of galaxies in distinct halos which describes the cluster- 
ing in the large scale limit, and from pairs of galaxies within 
a single halo which describes clustering in the small scale 
limit. The latter contribution requires a parametrisation to 
relate the number and spatial distribution of galaxies within 
a dark matter halo of a particular mass. It is by constraining 
this parameterisation that observed clustering can be used 
to understand how galaxies are distributed. 

By comparison with the massive optical redshift sur- 
veys, the largest survey of HI selected galaxies contains only 
~ 5000 sources, obtained as part of the HIPASS survey, a 
bhnd HI survey of the southern sky (Barnes et al. 2001). 
Meyer et al. (2007) have studied the clustering of these HI 
galaxies. Their analysis reached the conclusion of weak clus- 
tering of HI galaxies based on parametric estimates of cor- 
relation length, but did not study the clustering in terms 
of the host dark matter halo masses of the HIPASS sample. 
Wyithe, Brown, Zwaan & Meyer (2009) analysed the clus- 
tering properties of HI selected galaxies from the HIPASS 
survey using the formalism of the halo occupation distribu- 
tion. They found that the real-space clustering amplitude 
for HIPASS galaxies is significant on scales below the virial 
radius associated with the halo mass required to reproduce 
the clustering amplitude on large scales, indicating that sin- 
gle halo pairs are contributing a 1-halo term. However the 
resulting parameter constraints show that satellite galax- 
ies make up only 10% of the HIPASS sample. HI satellite 
galaxies are therefore less significant in number and in terms 
of their contribution to clustering statistics than are satel- 
lites in optically selected galaxy redshift surveys. 

These results from HOD modeling of HI galaxy clus- 
tering therefore quantify the extent to which environment 
governs the HI content of galaxies in the local Universe and 
confirms previous evidence that HI galaxies are relatively 
rare in overdense environments (Waugh et al. 2002; Cortes 
et al. 2008). Wyithe et al. (2009) found a minimum halo 
mass for HIPASS galaxies at the peak of the redshift distri- 
bution of Al ~ 10^^ M0 (throughout this paper we refer to 
the halo mass as M and the HI mass as Mm), and showed 
that less than 10% of baryons in HIPASS galaxies are in the 
form of HI. Their analysis also revealed that the fingers-of- 
god in the redshift space correlation function are sensitive 
to the typical halo mass in which satellite galaxies reside, 
and indicated that the HI rich satellites required to produce 
the measured 1-halo term must be preferentially in group 
rather than cluster mass halos. 

As described above, the clustering of HI galaxies can be 
studied at z — using HIPASS. However in the future with 
the advent of the Square Kilometer Array (SKA) and its 
pathfinders the volume and redshift range over which clus- 
tering of HI galaxies can be studied will greatly increase. On 
the other hand, these studies will still be limited to moder- 
ate redshifts of z < 0.5 owing to the sensitivity required 
for detection of even the most massive HI galaxies. For ex- 
ample, the Australian SKA pathfinder (ASKAP) will detect 
the most massive HI galaxies only out to z ~ 0.7 in the 
deepest integrations (Johnston et al. 2007). At higher red- 
shifts, we argue that progress on the clustering of HI galaxies 
may be made by measurement of fluctuations in the com- 



bined surface-brightness of unresolved HI galaxies (Wyithe 
& Loeb 2009; Chang et al. 2008; Wyithe 2008). A survey of 
21cm intensity fluctuations at redshifts beyond those where 
individual galaxies can be detected would therefore measure 
the modulation of the cumulative 21cm emission from a large 
number of galaxies. The detectability of the 21 cm PS after 
reionization was discussed by Khandai et al. (2009). These 
authors used an N-body and simulation to predict the sta- 
tistical signal of 21 cm fluctuations in the post-reionization 
IGM, and estimated its detectability. Khandai et al. (2009) 
find that a combination of these arrays ofl'er good prospects 
for detecting the 21 cm PS over a range of redshifts in the 
post reionization era. Importantly, a statistical detection of 
21 cm fluctuations due to discrete, unresolved clumps of neu- 
tral gas has already been made (Pen et al. 2008) through 
cross-correlation of the HIPASS (Barnes et al. 2001) 21 cm 
observations of the local universe with galaxies in the 6 de- 
gree field galaxy redshift survey (Jones et al. 2005). This 
detection represents an important step towards using 21 cm 
surface brightness fluctuations to probe the neutral gas dis- 
tribution in the IGM. 

The majority of the discussion in the literature con- 
cerning 21cm fluctuations in the low-redshift Universe 
has centered around their utility for cosmological con- 
straint (Wyithe & Loeb 2009; Chang et al. 2008; Loeb & 
Wyithe 2009; Bharadwaj, Sethi & Saini 2009). In this paper 
we concentrate on the possibility of studying the distribu- 
tion of HI within dark matter halos, on scales accessible to 
traditional configurations for radio interferometers (which 
do not include the very short baselines required to study 
the 21cm fiuctuations in the large scale, linear regime). On 
these scales recent simulations suggest that the smoothed 
HI density field is highly biased owing to non-linear gravi- 
tational clustering (Bagla & Khandai 2009). Following from 
this prediction we discuss the possibility of studying the oc- 
cupation of dark matter halos by HI at high redshift via 
21cm intensity mapping. As a concrete example we consider 
the potential of ASKAP with respect to constraining the HI 
HOD. We concentrate on z ^ 0.7, as this is the redshift at 
which ASKAP no longer has the sensitivity to study indi- 
vidual galaxies. Our goal is not to provide a detailed method 
for extracting detailed HOD parameters from an observed 
power spectrum of redshifted 21cm fluctuations. This would 
require calibration against N-body simulations, which is pre- 
mature at this time. Rather, we present an analytic model 
for the 21cm power spectrum in the HOD, and investigate 
which of its properties could be constrained by observations 
using a telescope like ASKAP 

The paper is organised as follows. We begin by sum- 
marising the formalism for the HOD model, and introduce 
HOD modeling of 21cm intensity fluctuations in § [2] We 
discuss the potential sensitivity of ASKAP to these fluctu- 
ations in § |3l We then present our forecast constraints on 
HOD parameters in § |3] and describe estimates of the HI 
mass function in § [5] We summarise our findings in § [G] In 
our numerical examples, we adopt the standard set of cos- 
mological parameters (Komatsu et al. 2009) , with values of 
f2m = 0.24, ilh = 0.04 and JIq — 0.76 for the matter, baryon, 
and dark energy fractional density respectively, h = 0.73, for 
the dimensionless Hubble constant, and as — 0.81 for the 
variance of the linear density field within regions of radius 
8/i"^Mpc. 
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2 INTENSITY MAPPING AND THE HI HOD 

We begin by reviewing the halo occupation distribution 
formalism for galaxies (e.g. Peacock & Smith 2000; Seljak 
2000; Scoccimarro et al. 2001; Berlind & Weinberg 2002; 
Zheng 2004) which we describe only briefly, referring the 
reader to the above papers for details. The technique of sur- 
face brightness mapping will not allow resolution of individ- 
ual galaxies, but rather the measurement of fluctuations in 
the surface brightness of unresulved galaxies. However we 
utilise a halo model formalism where galaxies are traced, 
rather than a form where the density field is a continuous 
function. This is because the HI is found in discrete galaxies, 
and treating individual galaxies allows us to explicitely cal- 
culate the HI mass weighted galaxy bias. The HOD model is 
constructed around the following simple assumptions. First, 
one assumes that there is either zero or one central galaxy 
that resides at the centre of each halo. Satellite galaxies are 
then assumed to follow the dark matter distribution within 
the halos. The mean number of satellites is typically as- 
sumed to follow a power-law function of halo mass, while 
the number of satellites within individual halos follows a 
Poisson (or some other) probability distribution. The two- 
point correlation function on a scale r can be decomposed 
into one-halo (^ih) and two-halo (^2h) terms 



(1) 



corresponding to contributions to the correlation function 
from galaxy pairs which reside in the same halo and in two 
different halos respectively (Zheng 2004). 

The 2-halo term can be computed as the halo correla- 
tion function weighted by the distribution and occupation 
number of galaxies within each halo. The 2-halo term of the 
galaxy power spectrum (PS) is 



1 



(it) 

dM—{N)Mb{M)y^{k,M) 



(2) 

where Pm is the mass PS and i/g is the normalised Fourier 
transform of the galaxy distribution, which is assumed to fol- 
low a Navarro, Frenk & White (1997; NFW) profile (see e.g. 
Seljak 2000; Zheng 2004). Here fig is the mean number den- 
sity of galaxies. We assume the Sheth Tormen (1999) mass 
function dn/dM using parameters from Jenkins et al. (2001) 
throughout this paper. To compute the halo bias b(M) we 
use the Sheth, Mo and Tormen (2001) fitting formula. The 
quantity Mmax is taken to be the mass of a halo with separa- 
tion 2r. The 2-halo term for the correlation function follows 
from 



1 

2^ 



Pgg (fc)fc 



sin kr 
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(3) 



In real space the 1-halo term can be computed using 
(Berlind & Weinberg 2002) 



dM 



dn {N{N~1)}m 



dM 



2i?vir(M) 



F' 



2-Rvir 



(4) 



where {N{N — 1))m is the average number of galaxy pairs 
within halos of mass Al. The distribution of multiple galaxies 
within a single halo is described by the function F'{x), which 
is the differential probability that galaxy pairs are separated 



by a dimensionless distance x = r/Rvir- As is common in the 
literature, we assume that there is always a galaxy located at 
the center of the halo, and others are regarded as satellite 
galaxies. The contribution to F' is therefore divided into 
pairs of galaxies that do, and do not involve a central galaxy, 
and is computed assuming that satellite galaxies follow the 
number-density distribution of an NFW profile. With this 
assumption, the term in the integrand of equation Q reads 

{N{N - 1))mF'{x) 

Hiv-i)MF4(.)+ <^^'^^f-^^^- i^4(x). (5) 

where F'{x) is the pair-number- weighted average of the 
central-satellite pair distribution F^g{x) and the satellite- 
satellite pair distribution F!;s{x) (see, e.g., Berlind & Wein- 
berg 2002; Yang et al. 2003; Zheng 2004), 



2.1 21cm intensity mapping of HI clustering 

The HOD method of estimating and modeling the cluster- 
ing of HI galaxies will not work at redshifts beyond z ~ 0.7, 
where even the most luminous galaxies will not be detectable 
in HI for the foreseeable future. Instead, observations of sur- 
face brightness fluctuations in 21cm intensity arising from 
the combined signal of a large number of unresolved galax- 
ies could be used to measure the clustering of HI galaxies. 
Studies of 21cm surface brightness fluctuations over a large 
volume will be made possible by the widefield interferom- 
eters now coming on line, and will allow the HI properties 
of galaxies to be studied over a greater range of redshifts. 
Indeed, it has been argued that lack of identification of in- 
dividual galaxies is an advantage when attempting to mea- 
sure the clustering of the HI emission, since by not impos- 
ing a minimum threshold for detection, such a survey col- 
lects all the available signal. This point is discussed in Pen 
et al. (2008), where the technique is also demonstrated via 
measurement of the cross-correlation of galaxies with unre- 
solved 21cm emission in the local Universe. 

The situation is analogous to mapping of the three- 
dimensional distribution of cosmic hydrogen during the 
reionization era through the 21cm line (Furlanetto, Oh & 
Briggs 2007; Barkana & Loeb 2007). Several facilities are 
currently being constructed to perform this experiment (in- 
cluding MWaQ, LOFAR0, PAPER 0, 21CMAEI) and more 
ambitious designs are being planned (SKA0). During the 
epoch of reionization, the PS of 21cm brightness fluctuations 
is shaped mainly by the topology of ionized regions. How- 
ever the situation is expected to be simpler following reion- 
ization of the intergalactic medium (IGM; z < 6) - when 
only dense pockets of self-shielded hydrogen, such as damped 
Lya absorbers (DLA) and Lyman-limit systems (LLS) sur- 
vive (Wyithe & Loeb 2008; Chang et al. 2007; Pritchard & 
Loeb 2008). These DLA systems are thought to be the high 
redshift equivalents of HI rich galaxies in the local Universe 
(Zwaan et al. 2005b). We do not expect 21cm self absorp- 
tion to impact the level of 21cm emission. This conclusion 

^ http:/ /www. haystack. mit.edu/ast/arrays/mwa/ 
^ http://www.lofar.org/ 

^ http:/ /astro. berkeley.edu/~dbacker/EoR,/ 
■* http://web.phys. emu. edu/^past/ 
^ http://www.skatelescope.org/ 
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is based on 21cm absorption studies towards damped Lya 
systems at a range of redshifts between 2: ~ and z ~ 3.4, 
which show optical depths to absorption of the back-ground 
quasar flux with values less than a few percent (Kanekar 
& Chengalur 2003; Curran et al. 2007). Moreover, damped 
Lya systems have a spin temperature that is large relative 
to the temperature of the cosmic microwave background ra- 
diation, and will therefore have a level of emission that is 
independent of the kinetic gas temperature (e.g. Kanekar & 
Chengalur 2003). Thus the intensity of 21cm emission can 
be directly related to the column density of HI. 

2.2 Modeling the power spectrum of 21cm 
fluctuations 

As mentioned above, low spatial resolution observations 
could be used to detect surface brightness fluctuations in 
21cm emission from the cumulative sum of HI galaxies, 
rather than from individual sources of emission. Here the PS 
is a more natural observable than the correlation function, 
since a radio interferometer records visibilities that directly 
sample the PS. In the linear regime the 21cm PS follows 
directly from the PS of fluctuations in mass Pm(fc) (Wyithe 
& Loeb 2009) 

PHi(fe) ~Tb2sm(fe)MPm(fc), (6) 

where % = 23.8 [(1 -I- 2:)/10] 5 mK is the brightness tem- 
perature contrast between the mean IGM and the CMB 
at redshift z, and {b)M is the HI mass weighted halo bias. 
Note that we have used the subscript hi rather than the 
more usual 21 in order to reduce confusion with the sub- 
scripts for the 1-halo and 2-halo PS terms. The fraction 
of hydrogen that is neutral is described by the parameter 
xui = nHi/(0.76f2b). We assume xui ~ 0.01 (correspond- 



ing to Qm 



3 X 10" 



Zwaan et al. 2005a) throughout 



this paper. The resulting PS is plotted in the right panel of 
Figure [T] (dashed line). The constant Tj, hides the implicit 
assumptions that the 21cm emission from the galaxies is not 
self absorbed, and that the spin temperature is much larger 
than the temperature of the cosmic microwave background. 
On small scales a model is needed to relate HI mass to halo 
mass. To achieve this we modify the HOD formalism as out- 
Uned below. 



2.2.1 HOD model for 21cm fluctuations 

Since surface brightness fluctuations depend on the total 
HI mass within a halo rather than on number counts of 
individual galaxies, the number of galaxies, and the number 
of galaxy pairs per halo in the HOD formalism need to be 
weighted by the HI mass per galaxy. In analogy with the 
HOD formalism, we distribute this mass between central 
and satellite galaxies. We define (Mhi,c)m and (Mhi,s)m to 
be the mean HI mass of central galaxies and of the combined 
satellite galaxies within a halo of mass M respectively. 

To compute the 2-halo PS, we replace (A'^) m in equa- 
tion ^ with the mean value of the total HI mass in a halo 
of mass M, i.e. {Mm)M = {Mm,c)M + (A/hls>a/, yielding 



,2h 



1 

.Phi jo 



X 



M„ 



fin 

dM—{Muj)Mb{M)y^{k,M) 



where, pni is the mean density of HI contributed by all galax- 
ies in the IGM. The 2-halo term ^2h,Hi('') follows from sub- 
stitution into equation Q. 

To compute the 1-halo term we again weight the number 
of galaxies by their HI mass. In difference from the calcula- 
tion of the 1-halo term for galaxy clustering, the distribu- 
tion of satellite masses will be important in addition to the 
number. This aspect of the HOD modeling will require sim- 
ulation for a proper treatment (e.g. Bagla & Khandai 2009). 
However for the purposes of our analysis it is sufficient to 
assume that most of the satellite HI for a halo mass M is 
contained within satellites of similar mass (as would be the 
case for a steep power-law mass function with a lower cutoff 
for example). We therefore further define (mHi.s) m to be the 
mean HI mass of satellite galaxies within a halo of mass M. 
The coefficients in equation ((5)1 are then modified to yield 

((iV - l){mm,s)M X (A/hi,c)a/) = (Mhi,c)m(A/hi,s)m (8) 
and 

{{N - l)(mHi..)M X (TV - 2){mm,s)M) _ {Mni,.)li 



2 

respectively. To calculate this second term we have noted 
that for a Poisson distribution of galaxies ((TV — 1)(TV — 
2))m = {{N-1))m, and have assumed that (A/s)m = ((A^- 
1))m X {ms)M. The modified expression for the 1-halo term 
therefore become^ 



-'b ^HI 

27rr-2p2; 



+ 



dn 



(A/hi,c)m(Mhi,s>mF4 



F' 



2-Rvir 



-dM.(lO) 



27?viryj RvAM) 
The correlation function follows from ^hi(»") ~ [Tl^x\i -f 

Clh,Hl(r)] -I- 6h,Hl(r). 

In order to evaluate this expression the HI mass occu- 
pation of a halo of mass A4 must be parameterised, and is 
obviously quite uncertain. For illustration, we choose the fol- 
lowing polynomial form, with a minimum halo mass (Afmin) 
and characteristic scale (Afi) where satellites contribute HI 
mass that is comparable to the central galaxy. 



(A/hi,c)a/ oc 
and 

(A/hi,s)m = 







Afei, 




M 



if M > M„ 
otherwise. 



if M > Afmin 

otherwise. 



(11) 



(12) 



The average HI mass within a halo of mass M > Afmin is 
therefore 



(Mhi)ai = (AfHi,c)A/ + (A/hi,s)a/ oc KP 



1 + 



M 
Ah 



(7) 



(13) 

Note that the constant of proportionality in equations (|11|) 
and p3[l is not specified but cancels with the same factor 



^ Note that the correlation function has dimensions of mK, and 
therefore does not have the usual interpretation of probability 
above random for finding a galaxy pair of separation r. 
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in Phi in equations (O and (I10|) . From experience of the 
galaxy HOD there will be degeneracy between the parame- 
ters Ml and 7s. We therefore make the simplification of set- 
ting 7c = 7s = 7 in our parameterisation for the remainder 
of this paper. The left panel of Figure [T] shows the real-space 
correlation function a,t z = 0.7 for an HOD model with pa- 
rameters 7 = 0.5, Afmin = 1O"M0 and Ah = IO^^Mq. This 
model serves as our fiducial case throughout this paper, and 
is motivated by the parameters derived from estimates for 
HIPASS galaxies ( Wyithe et al. 2009) . In particular we note 
the value of 7 = 0.5 which is smaller than unity. This value 
encapsulates the assumption that smaller halos have more 
HI, and agrees with the conventional wisdom that galaxy 
clusters are HI poor. In the local Universe 7 ~ 0.5 is found 
to describe the relation between HI and dynamical masses 
of HIPASS galaxies (Wyithe et al. 2009). Aside from this 
motivation the fiducial model is otherwise arbitrary. 

2.2.2 Redshift space modeling of the 21cm power spectrum 

Since a radio interferometer directly measures the 3 di- 
mensional distribution of 21cm intensity it is more pow- 
erful to work in redshift space, where line-of-sight infall 
(Kaiser 1987) can be used to break the degeneracy between 
neutral fraction and galaxy bias (Wyithe 2008). In addition 
to gravitational infall the shape of the redshift space PS 
will be complicated by peculiar motions of galaxies within 
groups or clusters, which produce the so-called fingers-of- 
god in the redshift space correlation function. In the case of 
21cm fluctuations, the internal velocities of HI in galaxies 
will also contribute to the fingers-of-god. In this paper we 
use the combination of the real-space HOD 21cm PS 

PnMk) = 47r J dr^Hi{r)^^r^dr, (14) 

and the dispersion model to estimate the redshift space PS 
including these efl'ects. The dispersion model is written 

Pz,Hl(fcpcrp, fclos) = PR,Hl(fc)(l + Pfi^il + k^l^i^ /2)-^ 

(15) 

where jj. is the cosine of the angle between the line-of-sight 
and the unit-vector corresponding to the direction of a par- 
ticular mode, fcperp = kfi, feios = k^l - fi^, (5 = / {h)M 
and (6)m is the average HI mass weighted halo bias. The 
quantity is a constant which describes a "typical" ve- 
locity dispersion for galaxies and parametrises the promi- 
nence of the fingers-of-god. Simulations indicate a value of 
fffc ~ 650/(1 -I- 2)kms"^ (Lahav & Suto 2004). We note that 
the redshift space PS could have been generated through a 
2-d Fourier transform of the redshift space correlation func- 
tion computed within the HOD model using the formalism 
described in Tinker (2007), allowing additional constraints 
on HOD parameters to be placed based on the prominence 
of the fingers-of-god as in Wyithe et al. (2009). In particular 
the assumption of 7 < 1 as well as the potential lack of HI in 
clusters would lead to reduced prominance of the fingers of 
god. However in the absence of current data we have taken 
the simpler approach of parameterising the fingers-of-god 
using cTfe. 

The left panel of Figure [2] shows the resulting redshift 
space PS for the fiducial model. The large scale motions 
induced by infall into overdense regions can be seen as an 



extension of the PS along the line-of-sight at small k, while 
the fingers of god are manifest as a compression at large k. 
The right panel of Figure [1] shows the corresponding spher- 
ically averaged redshift space PS (solid line) 

^z,^i(^) ~ J' Pz.iii{kipcYp.,k\oQ^dkpcYpdk\tjs. (1^) 

For comparison, the dotted lines in the right hand panel of 
Figure 2 show the 1-halo and 2-halo contributions to the 
spherically averaged redshift space PS. 

The spherically averaged redshift space PS can be com- 
pared with the linear real-space 21cm PS estimated based 
on the neutral fraction xhi ~ 0.01 and the linear mass PS 
(dashed line, equation[6ll. On large (linear) scales the spher- 
ically averaged PS is larger in redshift space than in real 
space. This is analogous to the excess power seen in red- 
shift space clustering of galaxy surveys (Kaiser 1987), and 
is due to an increase in the 21cm optical depth owing to 
velocity compression (Barkana & Loeb 2005) towards high 
density regions. On small scales there is excess power above 
the linear theory expectation owing to the inclusion of the 
non-linear 1-halo term. The 21cm PS shows a steepening 
at large k owing to the meiss weighting in the 1-halo term. 
This steepening is also seen in the simulations of Bagla & 
Khandai (2009). 

2.2.3 variation of the 21cm PS with HOD parameters 

Figure |3] illustrates the sensitivity of the clustering and the 
21cm PS to variations in the HOD parameters. The solid 
lines repeat the fiducial model from Figure [T] For compar- 
ison, the dotted, dashed and dot-dashed lines show varia- 
tions on this model, with 7 = 0.4, Mmin = lO^^M© and 
Ml ~ lO^^M© respectively (with the remaining parameters 
set to their fiducial values in each case). On the largest scales 
the clustering amplitude is most sensitive to Mmin (dashed 
lines), which enables the typical host halo mass of HI to be 
measured from the PS amplitude (Wyithe 2008). Lowering 
the value of Mmin (while keeping Mi fixed) implies a smaller 
fraction of HI in satellites, and hence a relative decrease of 
power on small scales. A smaller value of 7 also leads to a rel- 
ative decrease of power on small scales because the flatter 
power-law preferentially places mass in the more common 
low mass halos (with M < Mi), and so lowers the fraction 
of HI in satellites (dotted lines). Conversely, a smaller value 
of Afi leads to a larger fraction of HI in satellite systems, and 
hence an increase of small scale power (dot-dashed lines). 

The variation in shape and amplitude of the 21cm PS 
implies that parameter values for a particular HOD model 
could be constrained if the PS were measured with sufficient 
signal-to-noise. In the remainder of this paper we therefore 
first discuss the sensitivity of a radio interferometer to the 
21cm PS, and then estimate the corresponding constraints 
on the 5 parameters in our HOD model that could be placed 
using observations of 21cm intensity fluctuations. 



3 SENSITIVITY TO THE 21CM PS 

In this paper we estimate the ability of a telescope like 
ASKAP to measure the clustering of 21cm intensity fluc- 
tuations, and hence to estimate HOD parameters, and the 
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Figure 1. An example of the correlation function and PS of intensity fluctuations for a hypothetical HOD model at ^ = 0.7. Left-hand 
panel: the model correlation function. Right-hand panel: The corresponding spherically averaged redshift space PS (solid line). In each 
case the 1-halo and 2-halo terms are plotted as dotted curves. For comparison we plot the spherically averaged sensitivity within pixels 
of width Ak = fc/10 for an a radio interferometer resembling the design of ASKAP (thick gray line). We also show the real-space 21cm 
PS assuming a linear mass-density PS (dashed line). For calculation of observational noise an integration of 3000 hours was assumed, 
with a multiple primary beam total field of view corresponding to 30(1 -I- z)^ square degrees (see text for details). The cutoff at large 
scales is due to foreground removal within a finite frequency band-pass. 



power spectrum 





kp„p [1/cMpc] 



kperp [l/cMpc] 



Figure 2. The redshift space PS of intensity fiuctuations corresponding to the example in Figure [T] Left-hand panel: The redshift- 
space PS assuming a dispersion model with a/^ = 650/(1 2)kms^^. Right-hand panel: Contours of the signal-to-noise (separated by 
factors of vTO) within pixels of width Afc = fe/10. The thick contour corresponds to a signal-to- noise per pixel of unity. For calculation 
of observational noise an integration of 3000 hours was assumed, with a multiple primary beam total field of view corresponding to 
30(1 -)- z)^ square degrees (see text for details). The cutoffs at large and small scales perpendicular to the line-of-sight are due to the 
lack of short and long baselines respectively. The cutoff at large scales along the line-of-sight is due to foreground removal within a finite 
band-pass. 



total HI content of the Universe. The latter quantity, which 
is not available from the clustering of resolved galaxies, could 
be used to bridge the gap in measurements of Qm (the cos- 
mic density of HI relative to the critical density) between 
the local Universe where this quantity can be determined 
from integration of the HI mass function, and z > 2 where 
it is measured from the column density through counting of 
damped Lya absorbers. 

To compute the sensitivity APHi(fcpcrp, kios) of a radio- 
interferometer to the 21cm PS, we follow the procedure 
outlined by McQuinn et al. (2006) and Bowman, Morales 
& Hewitt (2007) [see also Wyithe, Loeb & Geil (2008)]. 
The important issues are discussed below, but the reader 
is referred to these papers for further details. The uncer- 
tainty comprises components due to the thermal noise, and 
due to sample variance within the finite volume of the ob- 



servations. We also include a Poisson component due to 
the finite sampling of each mode (Wyithe 2008), since the 
post-reionization 21cm PS is generated by discrete clumps 
rather than a diffuse IGM. We consider a telescope based 
on ASKAP. This telescope is assumed to have 36 dish an- 
tenna with a density distributed as p(r) oc within a 
diameter of 2km. The antennae are each 12m in diameter, 
and being dishes are assumed to have physical and effec- 
tive collecting areas that are equal. We assume that fore- 
grounds can be removed over 80MHz bins, within a band- 
pass of 300MHz [based on removal within 1/4 of the avail- 
able bandpass (McQuinn et al. 2006)]. Foreground removal 
therefore imposes a minimum on the wave-number acces- 
sible of fc ~ 0.02[(1 + 2;)/1.5]"^Mpc"\ although access to 
the large scale modes is actually limited by the number of 
short baselines available. An important ingredient is the an- 
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Figure 3. Examples of correlation functions {left panel) and spherically averaged redshift space PS (right panel) of intensity fluctuations 
for different HOD models at z = 0.7. The solid lines repeat the fiducial model from Figure [T] (parameters listed in the top-right corner of 
the left panel). For comparison, the dotted, dashed and dot-dashed lines show variations on this model, with 7 = 0.4, Mmin = lO^'^M© 
and Ml = IO^^Mq respectively (with the remaining parameters set to their fiducial values in each case). For comparison with the PS, 
we also plot the spherically averaged sensitivity within pixels of width Ak = fc/10 for an a radio interferometer resembling the design of 
ASKAP (right panel, thick gray line). 



gular dependence of the number of modes accessible to the 
array (McQuinn et al. 2006). ASKAP is designed to have 
multiple primary beams facilitated by a focal plane phased 
array. We assume 30 fields are observed simultaneously for 
3000 hr each, yielding ~ 30(1 + z)^ square degrees [where 
the factor of (1 + z)'^ originates from the frequency depen- 
dence of the primary beam]. The signal-to- noise for obser- 
vation of the PS in the left panel of Figure [2] is shown in 
the right panel of Figure [5] A telescope like ASKAP would 
be most sensitive to modes of fcpcrp ~ 0.1 — IMpc^^ and 
kioB ~ 0.03 — 0.3Mpc~^. The spherically averaged signal-to- 
noise (within bins of Afc — k/10) is shown in the right panels 
of Figures [1] and [3] (grey curves) . Comparison of the noise 
curve with the variability of the PS amplitude and shape 
among different HOD models for the 21cm PS (Figure [3]) 
indicates that a telescope like ASKAP would be sufficiently 
sensitive to generate constraints on the HOD. Moreover, the 
spatial scale on which the array would be most sensitive 
corresponds to wave-numbers where we expect 1-halo and 
2-halo contributions to be comparable, indicating that such 
observations may constrain HOD model parameters. This 
statement is quantified in the next section. 



trial models 



4 CONSTRAINTS ON HOD PARAMETERS 
FROM 21CM INTENSITY MAPPING 

Based on our estimate of the sensitivity to the 21cm PS we 
forecast the ability of ASKAP to constrain the HI HOD. 
To begin, we assume the fiducial model Pz^m{kpcrp, fcios), as 
shown in Figure (2] and estimate the accuracy with which 
the parameters could be inferred. Our HOD model for the 
21cm PS has five parameters, Mmin, Mi, 7, xm and at- 
For combinations of these parameters (Mmin, Mi, 7, xm, 
afc) that differ fi-om the fiducial case (IO^Mq, IO^^Mq, 0.5, 
0.01, 650/(l-|-z)km/s), we compute a trial model for the real 
space correlation function. We then use this to calculate the 
chi-squared of the difference between the fiducial and the 



2 

X = 



EE 



(17) 



-Pz.HI ^ -Pz,Hl(fcpcrp, fcloslMmin, Ml, 7, Xm, (Tfc, (Jg) 
A^ZjHI (^perp , ^los) 



and hence find the likelihood 

£(Mn,in, Ml, 7, XHi, cTfc) = j dcrg^exp (18) 

The uncertainty introduced through imperfect knowledge of 
the PS amplitude (which is proportional to the normal- 
ization of the primordial PS, ag) is degenerate with xh 
(Wyithe 2008). For this reason the uncertainty in ag has 
been explicitly included in equation (|18[l . We assume a Gaus- 
sian distribution dp/das for erg with erg = 0.81 ± 0.03 (Ko- 
matsu et al. 2009). 

Figure |4] shows an example of forecast constraints on 
HOD model parameters for a telescope like ASKAP, assum- 
ing a 3000hr integration of a single pointing 30(1 + z)^ 
square degrees] centered on z = 0.7. Results are presented 
in the upper panels of Figures |4] which shows contours of 
the likelihood in 2-d projections of this 5-parameter space. 
Here prior probabilities on \ogxm, log Mmin, log Afi, 7 and 
(jfc are assumed to be constant. The contours are placed at 
60%, 30% and 10% of the peak likelihood. The lower panels 
show the marginalised likelihoods on the individual param- 
eters iHi, Mmin, Ml and 7. 

A deep integration of a single pointing for a telescope 
like ASKAP would place some constraints on the minimum 
mass (the projected uncertainty on Mmin is ~ 0.5 dex), 
and measure the relationship between HI and halo mass (a 
~ 20% constraint on 7). In addition to these constraints on 
the halo occupation distribution of HI, observations of the 
21cm PS would also provide a measurement of the global 
neutral fraction (or equivalently Hhi), which would be con- 
strained with a relative uncertainty of 20% at z ~ 0.7. This 
indicates that 21cm intensity fluctuations could be used to 
measure the evolution of rini from z ~ 1 to the present day, 
even though ASKAP will not detect individual galaxies at 
z > 0.7 (Johnston et al. 2007). Measurement of JIhi based 
on the redshift space PS would be complimentary to the 
detection of individual rare peaks, which could facilitating 
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Figure 4. Example of forecast constraints on HOD model parameters from 21cm intensity fluctuations, assuming a SOOOhr integration 
of a 30(1 + z)-^ square degree field with an array based on the ASKAP design centered on z = 0.7 (further details in the text). The 
upper panels show contours of the likelihood in 2-d projections of the 5-parameter space used for the HOD modeling of 21cm intensity 
fluctuations, while the lower panels show the marginalised likelihoods on individual parameters. Here prior probabilities on 7, logxm, 
logM^in and log Mi are assumed to be constant. The contours are placed at 60%, 30% and 10% of the peak likelihood. The position of 
the dot indicates the peak likelihood in the 5-dimensional parameter space (i.e. the input model). 



a direct estimate of the cosmic HI mass density (Bagla & 
Khandai 2009). 



5 HI CONTENT AND THE HI MASS 
FUNCTION 

The combination of measurements for a^ni and the HOD pa- 
rameters 7, Mmin and Ml indicates that the HI mass func- 
tion (summing both central and satellite galaxies) could be 
approximated from the HOD using 



dn dn dM 
dMni ^ dM dMm ' 



(19) 



where Mm = CM^(1 
evaluated from 



[M /Mi)~') and the constant C is 



C 



0.76(nb/nM)p n 



dM 



(20) 



The left panel of Figure [5] shows the mass function for 
the fiducial model in Figure [1] (thick grey curve) as well 
as ten HI mass functions computed assuming parameters 
drawn at random from the joint probability distribution [oc 
-^min'^r^2;gj£(Minin,Mi,7,2;Hi,o-j.)], projectious of which 
are shown in Figure H) While the values over which the HI 
mass range extends in these realisations shows some vari- 
ability, the possibility of constraints on 7 and xhi implied 
by Figure |4] mean that the overall shape of the HI mass 
function could be quite well constrained by observation of 
a redshifted 21cm PS. In the central panel of Figure [5] we 
show the corresponding the HI mass functions for central 
galaxies [obtained by instead substituting A/hi = CM'']. In 
this case the range of realisations is much larger, which can 



be traced to the degeneracy between Mmin and Mi seen in 
Figure |4] 

In addition to the HI mass function, it would also be 
possible to constrain the fraction of hydrogen within galaxies 
that is in atomic form. This immber is given by 



/hi = 



Fcoi(Mmi„ 



(21) 



where fcoi(A/min) is the fraction of dark-matter that is col- 
lapsed in halos more massive than A/min. From the upper 
left panel of Figure U we see that there is a degeneracy be- 
tween xm and Afmin. Larger neutral fractions correspond to 
lower values of Afmin and hence larger collapsed fractions. 
As a result the ratio /hi would be very well constrained, 
as shown by the likelihood distribution in the right hand 
panel of Figure [5] (which is based on the distributions in 
the left panel of Figure [S}. The evolution of /, which can 
also be measured locally from clustering with a value of 



10" 



(Wyithe, Brown, Zwaan & Meyer 2009) 



will provide an important ingredient for studies of the role 
of HI in star formation. 



6 SUMMARY 

Due to the faintness of HI emission from individual galax- 
ies, even deep HI surveys will be limited to samples at rel- 
atively low redshift [z < 0.7) for the next decade. However 
these surveys will be able to detect fluctuations in 21cm in- 
tensity produced by the ensemble of galaxies out to higher 
redshifts, using observational techniques that are analogous 
to those being discussed with respect to the reionization 
epoch at z > 6 (e.g. Furlanetto, Oh & Briggs 2006). As 
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Figure 5. Examples of the range for the total halo (left) and central galaxy (central panel) HI mass functions. In addition to the fiducial 
case (thick lines, corresponding to the model in Figure [T}, ten HI mass functions are shown in each case with parameters drawn from 
the probability distribution for HOD parameters in Figure |4] (dot-dashed lines). In the right panel we show forecast for corresponding 
constraints on /hi- A SOOOhr integration of a 30(1 + 2)^ square degree field with an array based on the ASKAP design centered on 2 = 0.7 
(corresponding to Figure |4]l was assumed. The prior probability on log/jji was assumed to be constant. 



a result, studies of HI galaxy clustering could be extended 
to redshifts beyond those where individual HI galaxies can 
be identified through the use of 21cm intensity fluctuations. 
To investigate this possibility we have described an approx- 
imate model for the power spectrum of 21cm fluctuations, 
which is based on the halo occupation distribution formal- 
ism for galaxy clustering. Our goal for this paper has been to 
use this model to estimate the expected amplitude and fea- 
tures of the 21 cm power-spectrum, rather than to present 
a detailed method for extracting the halo occupation of HI 
from an observed power-spectrum. This latter goal would 
require numerical simulations (e.g. Bagla & Khandai 2009). 

To frame our discussion we have made forecasts for 
ASKAP, specifically with respect to the use of the 21cm 
power spectrum as a probe of the occupation of HI in dark 
matter halos. We have chosen z = 0.7 for our estimates, 
which is the redshift at which individual galaxies are no 
longer detectable with ASKAP in deep integrations. We 
have shown that a telescope based on the design of ASKAP 
will have sufficient sensitivity to yield estimates of the HI 
halo occupation. Because 21cm intensity fluctuations com- 
bine the integrated HI from all galaxies (not just those de- 
tected as individual sources), the clustering amplitude is 
proportional to the total HI content of the Universe. We 
find that an array with the specifications of ASKAP could 
yield estimates of the global HI density which have a rela- 
tive accuracy of ~ 20%. Clustering measurements in 21cm 
surface brightness could therefore be used to make measure- 
ments of the global HI content in the currently unexplored 
redshift range between the local Universe, and surveys for 
damped Lya absorbers in the higher redshift Universe. 

The cosmic star-formation rate has declined by more 
than an order of magnitude in the past 8 billion years (Lilly 
et al. 1996, Madau et al. 1996). Optical studies paint a some- 
what passive picture of galaxy formation, with the stellar 
mass density of galaxies gradually increasing and an increas- 
ing fraction of stellar mass mass ending up within red galax- 
ies that have negligible star-formation (e.g.. Brown et al. 
2008). On the other hand, the combination of direct HI ob- 
servations at low redshift (Zwaan et al. 2005; Lah et al 2007) 
and damped Lya absorbers in the spectra of high-redshift 
QSOs (Prochaska et al. 2005) show that the neutral gas den- 



sity has remained remarkably constant over the age of the 
universe. The evolutionary and environmental relationships 
between the neutral gas which provides the fuel for star for- 
mation and the stars that form are central to understanding 
these and related issues. The study of the halo occupation 
distribution of HI based on 21cm fluctuations has the poten- 
tial to allow these studies to be made at redshifts beyond 
those where individual galaxies can be observed in HI with 
either existing or future radio telescopes. 
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